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ABSTRACT. The field of epidemiology has presented fascinating and relevant questions for mathe- 
maticians, primarily concerning the spread of viruses in a community. The importance of this research 
has greatly increased over time as its applications have expanded to also include studies of electronic 
and social networks and the spread of information and ideas. We study virus propagation on a non- 
linear hub and spoke graph (which models well many airline networks). We determine the long-term 
behavior as a function of the cure and infection rates, as well as the number of spokes n. For each n 
we prove the existence of a critical threshold relating the two rates. Below this threshold, the virus 
always dies out; above this threshold, all non-trivial initial conditions iterate to a unique non-trivial 
steady state. We end with some generalizations to other networks. 
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1. Introduction 

1.1. Previous Work. The general problem of studying the propagation of a node-state within a 
large interconnected network of nodes has a wide range of applications across domains, such as 
studying computer virus propagation in computer science, studying the penetration of a meme or 
product in marketing and sociology, and studying the propagation of an infection in epidemiology. 
Many of the earliest investigations HBal IKeWhl IMcKI assume a homogenous network, where each 
node has identical connections to all other nodes: for such networks, the rate of virus propagation 
was then shown to be determined by the density of infected nodes. While mathematically tractable, 
the results in IIFFFl IRiDol IRiFoIal also suggested that such homogenous models fail to represent 
many real networks. There has thus also been work on alternatives to this strict homogeneous 
model. For instance, HP-SVU IP-SV21 IP-SV31 IP-SV41 IMP-SVH study power law networks, where 
the probability of a node having k neighbors is proportional to A;~ 7 for some exponent 7 > 0. Al- 
though more realistic, HWKEI shows that even this model is not well-suited for many real networks. 
Moreover, an issue with these results is that their models, describing the propagation of node-states, 
themselves are dependent on the network topology. In contrast to these, HWDWFM proposes a more 
natural topology-agnostic model that relies on local node interactions. Specifically, their proposed 
SIS (Susceptible Infected Susceptible) model is a discrete-time model where each node is either 
Susceptible (S) or Infected (I). A susceptible node is currently healthy, but at any time step can be 
infected by its infected neighbors. At any time step moreover, an infected node can be cured and 
go back to being susceptible. The model parameters are 0, the probability at any time step that 
an infected node infects its neighbors, and 5, the probability at any time step that an infected node 
is cured. A central set of questions given this model for propagation of a node-state through the 
network are: 



(1) Given a set of model parameters and a particular initial state, does the system then reach a 
steady state? 

(2) If the system does reach a steady state, what are the characteristics of that state? 

(3) What is the dynamical behavior (rate of convergence) of the system? 



For the SIS model, Wang et al. HWDWFII gave a heuristic argument for a sufficient criterion for 
the node infection probabilities to converge to a trivial solution, so that the infection dies out. Using 
a reasonable approximation to eliminate lower order terms, they conjecture a sufficient condition 
for the virus to die out. For star graphs, this condition is b < (1 — a)/ y/n, where a — 1 — 6 and 
b = (3. One of the main contributions of this paper making this argument rigorous. Indeed, given the 
nonlinear coupled dynamics of the SIS model, it is typically intractable to argue rigorously about 
asymptotic state characteristics. But for star graphs, we are able to show that the SIS model exhibits 
phase transition behavior, and moreover that this threshold is both necessary and sufficient. Thus, 
below this threshold the virus dies out, and above the system converges to a non-trivial steady state 
independent of the initial state (provided only that the initial state is non-trivial). One consequence 
of this is that even if a single spoke node is infected initially, so long as the model parameters lie 
beyond the phase transition point, the infection will not die out (i.e., the node infection probabilities 
will not converge to the trivial point). We prove our results through a novel two-step argument, 
by first reducing the problem to one with a smaller graph size, and then applying the intermediate 
value theorem to the dynamics over the reduced graph. 
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Figure 1 . Star graph with 1 central hub and n spokes. 

1.2. Problem Setup. Y. Wang, C. Deepayan, C. Wang and C. Faloutsos HWDWFM proposed the 
following propagation model. Denote by 0, the probability at any time step that an infected node 
infects its neighbors, and by 5, the probability at any time step that an infected node is cured. 

If p-i it is the probability a node i is infected at time t, the SIS model is governed by the following 
equation: 

l - Pu = (l - Pi,t-i) d,t + $Pi,t(i,t, (l-i) 

where is the probability that a node i is not infected by its neighbors at time t. We can express 
Q it as follows: 

Cm = YlPj t-i (1 - P) + (1 -Pj,t-x) = ~ (L2) 

(where j ~ % means i and j are neighbors — i.e., are connected by an edge of the graph). Given the 
non-linear coupled form of this system, a closed form expression for p i>t for the general topology 
case seems infeasible. 

We therefore consider a specific graph topology, that of a star graph (see Figure [T]). This is a 
graph in which there is a single "hub" node which is connected to all the other nodes, the "spokes." 
Suppose the graph has n + 1 nodes: the hub is numbered and the spokes are numbered 1 through 
n. 

Proposition 1.1. For any initial configuration, as time evolves all the spokes converge to a common 
behavior. 

Proof. (11.11) becomes 

n n 

po,t = i - (i - po,t-i) n (i - Pj,t-i) - 8po,t n (i - Ppj,t~i) 

3=1 j=l 

Pi,t = 1 - (1 - Pi,t-i) (1 - /3po,t-i) - &Pi,t (1 - PPo.t-i) , 1 < n < n + 1. (1.3) 

We can immediately observe that all the spokes assume identical values quite rapidly. We prove 
this below by showing that for i, j ^ 0, \p ijt — Pj jt \ — > as t — > oo. We have 

Pi,t ~ Pj,t = (Pi,t-i ~ Pj,t-i) (1 - 0Po,t-i) ~ 5 {pi, t - Pj,t) (1 - 0Po,t-i) 
( 1 - 0p o ,t-i 



S(l-0p 



Pi,t-i - Pj,t-x- (1-4) 



Thus we have 



Since the quantity to the t th power cannot stabilize at 1 as the denominator is at least 1 + 5 and the 
numerator is at most 1, the right-hand side in ( 11.51) decays to as t — > oo. □ 
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(1.6) 



F(x,y) 



(1.7) 



An important consequence of this observation is that it allows us to simplify our model to a model 
in terms of x t , the probability that the hub is infected, and y t , the probability that a spoke is infected. 
These then evolve according to 

f xt+i \ = F ( x t 

V vt+i ) \yt 

where 

" fi(x,y) \ ( l-(l-x)(l-Py) n -8x(l-p y y 
f 2 (x,y) J V l-0--y)0--Px)-8y(l-Px) 

1 - (1 - ax) (1 -by) n 
1 - (1 - ay) (1 - bx) 

recall that we have defined a := 1 — 5 and b := (3 to simplify the algebra. 
1 .3. Main Results and Consequences. Our main result is the following. 

Theorem 1.2. Let a,b G (0, 1) and F as in (11. 7b describes the limiting behavior of the spoke and 
star network. 

I. Ifb < (1 - a)/y/n, then 

(a) the unique fixed point of F is (0, 0), and 

(b) the system converges to this fixed point, that is, the virus dies out. 

II. Ifb > (1 — a)/ y/n then, so long as the initial configuration is not the trivial point (0, 0), 

(a) F has a unique, non-trivial fixed point (xf, yf), where Xf and y/ are functions of a, b and n, 
and 

(b) the system evolves to this non-trivial fixed point. 

Remark 1.3. In the notation of HWDWFU . the critical threshold for the epidemic is (3/5 < I/X^a, 
where X^a is the largest eigenvalue of the adjacency matrix A of the network. For a star graph 
with n spokes connected to the central hub, Ai ^ = \/n. Recalling our a = 1 — 5 and b = (3, their 
condition is equivalent to b = ( 1 — a) / yfn, exactly the condition we have. 

While previous work suggested the veracity of the above claim, it was through heuristic argu- 
ments and numerical simulations. We opted for a theoretical investigation, so as to lend additional 
plausibility to the general conjecture and to develop some techniques potentially useful for eventu- 
ally resolving it. 

The proof of this theorem is distributed over the next few sections. In we prove parts 1(a) and 
11(a) by determining the fixed points of F . Using convexity arguments, we show that the trivial fixed 
point is the only fixed point if b < (1 —a)/ \/n, but there is a unique, additional fixed point for larger 
b. We prove 1(b) in §(3j namely that for b < (1 — a)/ \Jn (so b is at or below the critical threshold) all 
initial configurations evolve to the trivial fixed point. The proof involves linearly approximating the 
map F near the trivial fixed point and controlling the resulting eigenvalues. Finally, we show 11(b) 
in §E1 where we prove that all non-trivial initial configurations converge to the unique non-trivial 
fixed point when 6 > (1 — a)/ y/n. This last case is handled by noting that there is a natural partition 
of the domain [0, l] 2 of F into four regions (see Figure [3]), where the partitions are induced from 
functions related to determining the location of F's fixed points. The analysis of F on all of [0, l] 2 
is complicated, but the restrictions of each region lead to F having simple behavior in each region. 
We end with a discussion of the rate of convergence and the restriction of F to these regions in §|5j 
and discuss some generalizations to other graph topologies. 

2. Determination of Fixed Points of F 



In this section we determine the behavior of the fixed points of the system as a function of the 
parameters a, b and n, proving Theorem 11.21 1(a) and 11(a). The proof relies on some auxiliary 
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Figure 2. Partial fixed points from <pi and <p 2 when (from left to right) b < (1 — 

a)/s/n, b = (1 - a)/y/n, b > (1 - a)/y/n(b = 3,n = 4, a = .1, .4, .7). 

lemmas, which we first show. Specifically, the proofs look for partial fixed points, namely points 
where either the x or y-coordinate is unchanged. We prove that the set of partial fixed points can be 
defined by continuous functions 0i and (fi 2 , whose intersections are the fixed points of the system 
(see Figure[2]). 

We begin with the following lemma characterizing these curves. 

Lemma 2.1. Consider the map F given by (11.71) . 

(1) There exists a continuous, twice dijferentiable convex function <pi : [0, 1] — > [0, 1] such that, 
for each y G [0, 1], there is ay' G [0, 1] with F(<f>i(y), y) = {<pi{y), y'). 

(2) There exists a continuous, twice dijferentiable concave function (p 2 '■ [0, 1] —> [0, 1] such 
that, for each x G [0, 1], there is an x' G [0, 1] with F(x, 020*0) = ^(x)). 

Proof. We define 

9l (x,y) = (l-(l-ax)(l-by) n )-x (2.1) 

and 

g 2 (x,y) = (l-(l-ay)(l-bx))-y. (2.2) 
We first analyze the set of pairs (x,y) G [0, l] 2 where g± (x,y) = 0. We immediately see 
that 0i (0,0) = 0, gi(0,y) > for y G (0,1], and gi(l,y) < for y G [0,1]. Thus by the 
Intermediate Value Theorem, for each y G (0, 1] there is a number (which we denote by <fri(y)) 
such that gi(4>\(y),y) = and (f>i(y) G [0, 1]. It is easy to see that (f>i(y) is a continuous and 
differentiable function of y; in fact, 

\-{l-by) n 



Xv) 



1 - a(l - by) n 
nb(l -a)(l -by)^ 1 



(2.3) 



(1 -a(l -by) n ) 2 

Note (j>i(y) G [0, 1]: it is clearly positive, and > 1 for c > only when a > 1. As a, b G (0, 1), 
4>[(y) > 0. Thus 4>i(y) is strictly increasing. 
We analyze g 2 (x, y) = similarly. We find 

g 2 (x,y) = (l-(l-ay)(l-bx))-y = 0. (2.4) 

Note 02 (0, 0) = 0, g 2 (x, 0) > for x G (0, 1], and g 2 (x, 1) < for x G [0, 1]. Solving yields 

bx 



y 



We can rewrite this as a function of y as follows: 

x = (j) 2 (y) -- 



1 — a + afex 



(l-a)y 
6(1 - ay)' 



(2.5) 



(2.6) 
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This is clearly continuously differentiable, and 

Thus (f> 2 (y) is an increasing function of y. 

We now prove that <f>\ (y) is convex and (f> 2 (y) is concave. Straightforward differentiation and 
some algebra gives 

b 2 n(l - a)(l - by) n - 2 ■ (n - 1 + ail - by) n + a(n + 1)(1 - by) n ) 

MV) = (l-a(l- W 3 < ° 

2a (1 

6(1 - ay) c 

Thus 0i (y) is convex while <p 2 (y) is concave. Direct inspection shows each function is twice 
continuously differentiable. □ 

The next lemma is useful in determining the number and location of fixed points of our map F. 



<t>i{v) = 777 A > °- ( 2 - 8 ) 



Lemma 2.2. Let hi, h 2 be twice continuously differentiable functions such that hi (x) is convex 
and hi (x) is concave. If there exists some p such that h[ (p) < h' 2 (p) and h± (p) = h 2 (p), then 
hi (x) 7^ h 2 (x) for all x > p. 

Proof. As hi (x) is convex and h 2 (x) is concave, h[ (x) is decreasing and h 2 (x) is increasing. 
Thus, since h[ (p) < h' 2 (p), h[ (x) < h 2 (x) for all x > p. As hi (p) = h 2 (p), this implies that 
hi (x) < h 2 (x) for all x > p. □ 

We now determine the location of the fixed points. 

Proof of Theorem 17.21 1(a). Note that 

<P[ (0) = & (0) = i^. (2.9) 



From these equations, we can see that <f> 2 (0) > <fi'i (0) when b < (1 — a)/ ^Jn. Thus by Lemma [2T2l 
when b < (1 — a)/ y/n, there is no y > such that (pi (y) = <\> 2 (y). The trivial fixed point is thus 
the unique fixed point in [0, l] 2 . □ 

We next prove that for b > (1 — a)/y/n, there exists a unique non-trivial fixed point. The key 
ingredient is the following lemma. 

Lemma 2.3. Let hi,h 2 : [0, 1] — > [0, 1] be twice continuously differentiable functions such that 
hi(x) is convex, h 2 (x) is concave, hi(0) = h 2 (0) = and hi(x) ^ h 2 (x) for x > sufficiently 
small. Then there exists at most one other x > Ofor which hi(x) = h 2 (x). 

Proof. The claim is trivial if there is only one point of intersection, so assume there are at least two. 
Without loss of generality we may assume p > is the first point above zero where hi and h 2 agree. 
Such a smallest point exists by continuity, as we have assumed hi(x) ^ h 2 {x) for x > sufficiently 
small; if there are infinitely many points x n where they are equal, let p = liminf n x n > 0. 

Because hi{x) is convex, h\(x) is increasing. By the Mean Value Theorem there is a point 
ci G (0,p) such that 

, hi(p) - hi(0) hi (p) 

hi(ci) = = . (2.10) 

p — p 

As h[ is increasing, we have h' x {jp) > hi(ci); further, h[(x) > hi(ci) for all x > p. As h 2 (x) is 
concave, h 2 (x) is decreasing. Again by the Mean Value Theorem there is a point c 2 e (0,p) such 
that 

v h 2 (p) - h 2 (0) h 2 (p) 

h 2{C2) = ^ = , (2.11) 

p — p 
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h' 2 (p) < h 2 (c 2 ), and h' 2 (x) < h' 2 (c 2 ) for all x > p. But since hi (p) = h 2 (p), h[(ci) = h' 2 (c 2 ), so 
h[(x) > h' 2 (x) for all x > p. Thus we know from Lemma |2T2l that there cannot be another point of 
intersection after p. □ 

We are now ready to complete the analysis. 

Theorem \1.2\ 11(a). We first prove existence and then uniqueness. When b > (1 — a)/ y/n, we know 
from the proof of Theorem 1 1 .21 1(a) (see (12.91 )) that X (y) is above </> 2 (y) near the origin since 
4>[ (0) > 02 (0). The existence of the non-trivial point of intersection follows from the Intermediate 
Value Theorem. We recall that y = 4>2(x) is defined in [0, 1] for all x £ [0,1], and x = (f>i(y) is 
defined in [0, 1] for all y £ [0, 1]. As x — > 1 we have 4> 2 {x) tends to a number strictly less than 1. 
Thus the curve y = 02 (z) hits the line x — 1 below (1,1). Similarly the curve x = 4>i(y) hits the 
line y — 1 to the left of (1, 1). Thus the two curves flip as to which is above the other, implying that 
there must be one point where the two curves are equal. 

We now have two fixed points, the trivial fixed point and the non-trivial fixed point from the 
second intersection of the two curves. By Lemmas |2 . 1 1 and [231 there are no other fixed points, and 
thus there is a unique, non-trivial fixed point. □ 

3. Dynamical Behavior: b < (1 - a)/y/n 

In this section we show how an eigenvalue perspective can completely determine the dynamics if 
b < (1 — a)/ y/n, proving Theorem ll.2l 1(b). As these methods fail for larger b, we adopt a different 
perspective in §0] 

3.1. Technical Preliminaries. Our analysis of the dynamical behavior relies on the following 
lemma. 

Lemma 3.1. Let a, b £ (0, 1) with b < (1 — a)/ \fn, and let \\ > \ 2 denote the eigenvalues of the 
matrix ( ^ n ^j? J, where a, /3, 7, 5 £ [0, 1]. Then —1 < A 1; X 2 < 1. 

Proof. The sum of the eigenvalues is the trace of the matrix (which is a(a + 5)), and the product 
of the eigenvalues is the determinant (which is a 2 a5 — nb 2 ^). Thus the eigenvalues satisfy the 
characteristic equation 

A 2 -a(a + 5)X + (a 2 a5-nb 2 f3-f). (3.1) 

The eigenvalues are therefore 

a(a + 5)± \Ja 2 (a + 5) 2 - A(a 2 a5 - nb 2 ^) a(a + S) ± \/a 2 (a - 5) 2 + Anb 2 ^ 

2 = 2 • ( } 

As the discriminant is positive, the eigenvalues are real. Since a(a + 5) > 0, we have |A 2 | < Ai, 
where 

a(a + 8) + sJa^a-Sf + Anb 2 ^ 
U S Ai — . {J-J) 

As /?7 < 1, nb 2 < (1 — a) 2 and y/u + v < \fu + ^/v for u, v > we find 

a(a + 5) + ^a 2 (a - 5) 2 + ^4(1 - a) 2 



Ai < 



2 

a(a + 5) + a\a - 5\ + 2(1 - a) 
2 

2a max(a, 5) + 2(1 — a) 
2 

= 1 - (1 -max(a,5))a < 1, (3.4) 
where the last claim follows from a, a, S £ [0, 1] . □ 
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3.2. Proofs. Armed with the following, we now prove the first half of our main result, the dynam- 
ical behavior at or below the critical threshold. 

We prove the claim by using the Mean Value Theorem and an eigenvalue analysis of the resulting 
matrix. From Theorem 1 1.2[ 1(a) we know (0, 0) is the unique fixed point. We have 

f(( u Y\ ( l-(l-au){l-bvT \ 
1 \\ v J J ~ \ l-(l-av)(l-bu) J' 



Let 



c(t) = (!-*)( S)+*m, c(t) = (y )■ (3-6) 



Thus c(t) is the line connecting the trivial fixed point to ( X J , with c(0) = ( n ) anc * 



. Let 



y r y ' o 



™ - /<««)) - ( \ ) ■ <") 

Then simple algebra (or the chain rule) yields 

, _ ( a(l - bty) n nb{\ - atx){l - bty)^ 1 \fx\ 
* K ) ~ \ b{\ - aty) a(l - btxu) ) \ y )' 

We now apply the one-dimensional chain rule twice, once to the x-coordinate function and once 
to the y-coordinate function. We find there are values t\ and t 2 such that 

f(( x \\-f(f°\\ f a(l-bt iy ) n nb(l-at 1 x)(l-bt 1 y) n - l \(x\ 
\\yj) 1 \\0 J) ~ { b(l-at 2 y) a(l-bt 2 x) AW 

To see this, look at the x-coordinate of Tit): h(t) = 1 — (1 — atx)(l — bty) n . We have h(l) — h(0) 
= h(l) = ti(tx)(l - 0) for some t x . As 



h'ih) = ax{\ - buy)" 1 + nby{\ - ahx)^ - bhy)' 



n-l 



= (a(l - bt iy ) n , nb(l - at lX )(l - bhy)^ 1 ) i X y \, (3.10) 

the claim follows; a similar argument yields the claim for the y-coordinate (though we might have to 
use a different value of t, and thus denote the value arising from applying the Mean Value Theorem 
here by t 2 ). We therefore have 

( ( x W f a ( l ~ ht iy) n nh ( l ~ atxx){l - bt iy ) n - 1 \ ( x 

\.\yJJ ~ \Kl-at 2 y) a(l-bt 2 x) ){y 

= A{a,b,x,yMM){l\ (3-11) 

To show that / is a contraction mapping, it is enough to show that, for all a, b with b < (1 —a) / y/n 
and all x, y E [0, 1] that the eigenvalues of A(a,b,x,y,ti,t 2 ) are less than 1 in absolute value; 
however, this is exactly what Lemma [3TT1 gives (note our assumptions imply that a = (1 — btiy) n 
through 5 = (1 — bt 2 x) are all in (0, 1)). Let us denote A max (a, b) the maximum value of Ai for 
fixed a and b as we vary t\, t 2 , x, y G [0, 1]. As we have a continuous function on a compact set, 
it attains its maximum and minimum. As Ai is always less than 1, so is the maximum. Here it 
is important that we allow ourselves to have t±, t 2 E [0, 1], so that we have a closed and bounded 
set; it is immaterial (from a compactness point of view) that a,b E (0, 1) as they are fixed. As 
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FIGURE 3. The four regions determined by 0i and </>2 when b > (1 — a)/ \fn. 



< a,b < 1, we have a, f3, 7, 5 < 1 and thus the inequalities claimed in Lemma l3Tl hold. For any 
matrix M we have 1 1 Mv 1 1 < | A max \\\v\\; thus 

f((l)) < A max (a,6) 



as A max (a, b) < 1 we have a contraction map. Therefore any non-zero 



(3.12) 



iterates to the trivial 



fixed point if b < (1 — a)/ y/n and n > 2. In particular, the trivial fixed point is the only fixed point 
(if not, A(a, b, x, y, t\, t 2 )v = v for v a fixed point, but we know \ \A(a, b, x, y, t±, t2)v\\ < \\v\ \ if v 
is not the zero vector). 

Remark 3.2. Unfortunately this eigenvalue approach does not work in a simple, closed form man- 
ner for general b > (1 — a)j \fn. We include details of such an attempted analysis in Appendix 
B. 



4. Dynamical Behavior: b > (1 - a)/s/n 



In this section we prove Theorem 11.21 11(b), establishing convergence to the non-trivial fixed 
point. 

4.1. Properties of the Four Regions. Unfortunately, the method of eigenvalues does not seem to 
naturally generalize to large b. While it is possible to compute the eigenvalues of the associated 
matrix, it does not appear feasible to obtain a workable expression that can be understood as the 
parameters vary; however, breaking the analysis of F into regions induced from the maps 0i and 2 
of §[2] turns out to be very fruitful. This is because these curves determine partial fixed points. See 
Figure [3] for the four regions. 

We first study the effect of F in Regions I and III. Our first lemma provides some general infor- 
mation about the image of these regions under F, which we then use to show in the next lemma that 
F maps each of these Regions I and III to themselves. 

Lemma 4.1. Let b > (1 — a)/ y/n. Points in Region I strictly increase in x and y on iteration by F, 
and points in Region III strictly decrease in x and y on iteration. 

Proof. A point (x, y) in Region I satisfies the inequalities 

x < y - p- (4.1) 

1 - a(l - by) n 
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and 

bx 

y < z — -r-. (4.2) 

1 — a + abx 

By multiplying by the denominator on both sides for both inequalities, we find that 

x — ax (1 — by) n < 1 — (1 — by) n 

y — ay + abxy < bx. (4.3) 

Rearranging these terms gives 

x < 1 - (1 - by) n + ax(l - by) n = 1 - (1 - ax)(l - by) n = h (x, y) (4-4) 

and 

y < ay + bx — abxy = 1 — (1 — ay)(l — bx) = f 2 (x, y) . (4.5) 

Thus, the x and y coordinates of the iterate of a point in Region I are strictly greater than the x and 
y coordinates of the initial point. 

The proof for points in Region III is exactly analogous except with the inequalities flipped. Thus 

x > K - fV (4-6) 

l-a(l-by) n 

and 

y > (4-7) 

1 — a + abx 

imply that 

x > 1 - (1 - ax)(l - by) n = fx (x,y) (4.8) 

and 

y > l-(l-ay)(l-bx) = f 2 (x,y), (4.9) 

i.e., the x and y coordinates of the iterate of a point in Region III are strictly less than the x and y 
coordinates of the initial point. □ 



Lemma 4.2. Let b > ( 1 — a) / yjn. The image of Region I under F is contained in I, and the image 
of Region III under F is contained in Region III. 

Proof. We prove that for a point (x, y) in Region I, its iterated x-coordinate satisfies (14.11 ) and its 
iterated y-coordinate satisfies (14.21) . 

^-Coordinate Iteration: 

We must show that 

^-■»-^< ;;a;a:3i; . <«* 

We'll do this by first showing the left hand side is less than Yzz^z^yi > 1 — (1 — ax)(l — by) n , 
which we then show is less than the right hand side. 
Since (x, y) is in Region I, we know that 

x < 1 - (1 -ax)(l -by) T \ (4.11) 

which implies that 

< 1. (4.12) 



1 - (1 - ax)(l - by) n 
Since < a, b, y < 1, we know that a(l — by) 11 > 0. Thus, 



1 " X{1 ~ b " r r , > 1 - a(l - byT. (4.13) 

1 - (1 -ax)(l -by) n y y> v > 
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We simplify the left side of the inequality: 

1 - (1 -ax)(l -by) n ax(l-b y y 



1 - (1 - ax)(l - by) n 1 - (1 -ax){\ -by) 1 
1 - (1 - by) n + ax{\ - by) 71 ax(l - by) n 



1 - (1 - ax)(l - by) n 1 - (1 - ax)(l -by) n 

1 - (1 - by) n 



> l-a{l-by) n 

> l-a(l-by) n 

> l-a(l-by) n . (4.14) 



1 - (1 -ax)(l - by}' 
Finally, we rearrange the inequality, and obtain our intermediate step: 

1 _ (1 _ fa,)n 

T _^ I -W_ >!-(!- (4.15) 

For the second part of the proof, recall that 

y < l-(l-ay)(l-bx), (4.16) 

which implies 

(1 _ 6(i _ (i _ ay )(i _ bx )))n < (1 _ by y (4 1?) 

Now we let (1 - b(l - (1 - ay){\ - bx))) n = c and (1 - by) n = c + S where < c < 1 and 5 > 
such that c < c + 5 < 1 . Then we can write 

-5 < -a5 

1 — c — S — ac + ac 2 + acS < 1 — c — ac + ac 2 — aS + aSc 
(1 - ac)(l - c - 8) < (1 - ac - a5){l - c) 

1 -< + 4 ' < (4.18) 



Thus 



1 — a(c + 5) 1 — ac 

1 _ (1 _ 6(1 _ (1 _ ay )(l - bx))) n > !-(!-%)" _ (4 J9) 



1 - a(l - 6(1 - (1 - - te))) n 1 - a(l - by) n 
The desired result follows from (14.151) and ( 14.191) . 



y-Coordinate Iteration: 

We must show that 

1 - (1 - „)(! - fa) < "'-"-"'"-W,,. (4.20) 

V yJK 1 1 — a + ab(l — (1 — ax){l — by) n ) V 

We argue similarly as before, first showing the left hand side is less than 1 _( 1 _ a b ^( 1 _ 6a; ) > which we 
then show is less than the right hand side. Since (x, y) is in Region I, we know that 

y < \-{l- a y){l-bx), (4.21) 

which implies that 

< 1. (4.22) 



1 - (1 - ay)(l - bx) 
Since < a, b, x < 1, we know that abx — a < 0. Thus, 



y(abx — a) „ , 

!+i 77^ w-, , \ > 1-a + abx. (4.23) 

1 — (1 — ay)(l — bx) 
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We simplify the left side of the inequality: 

1 — (1 — ay)(l — bx) y(abx — a) 

+ ^ w '— ; > 1-a + abx 



1 - (1 - ay)(l - bx) 1 - (1 -ay)(l - bx) 

ay + bx — abxy abxy — ay 

1 - (l-ay)(l-bx) + I - (I - ay)(l - bx) 

bx 



> 1 — a + abx 

> 1 - a + abx. (4.24) 



l-(l-ay)(l-6x) 
Rearranging the inequality yields our intermediate step: 

bx 

> i _ (i _ ay )(i _ 6x ). (4.25) 

1 — a + aox 

For the second part of the proof, recall that for a point in Region I 

x < 1 - (1 - ax)(l - by) n . (4.26) 

This allows us to write 1 — (1 — ax)(l — by) n = x + c for some c > such that x < x + c < 1. 
Since c > and a, 6 < 1 we see that 

be — abc > 

bx + be — abx — abc + ab 2 x 2 + ab 2 xc > bx — abx + afr 2 x 2 + ab 2 xc 

b(x + c)(l - a + abx) > bx(l - a + ab(x + c)). (4.27) 



Thus 



that is, 



6(X + C) > (4.28) 



1 — a + a6(a; + c) 1 — a + abx ' 
6(1 _ (i - ax)(l - 6j/)») > to 



l-a + a6(l-(l-aa;)(l-&y) n ) 1 - a + abx 
The desired result follows from (14.251) and ( 14.291) . 

The proof showing that all points in Region III iterate inside Region III under F is essentially the 
same, now taking (14.81 ) and (14.91 ) as the initial inequalities. Thus given a point in Region III, we find 
that its iterated x-coordinate satisfies (14.61 ) and its iterated y-coordinate satisfies (I4.7I ). □ 

4.2. Limiting Behavior. Before proving Theorem 11.21 11(b) in general, we concentrate on the 
special case when the initial state is in Region I or III. 

Lemma 4.3. Let b > (1 — a)/ y/n. All non-trivial points in Regions I and III iterate to the non-trivial 
fixed point under F. 

Proof. Consider any non-trivial point z = (xq, yo) in Region I. Define a sequence by setting z t +i = 
F (z t ). By Lemma |4~T1 we know that z t is monotonically increasing in each component, and is 
always in Region I. Furthermore, we know that z t is bounded by (x/, yf) (the unique, non-trivial 
fixed point). Thus, z t must converge. Suppose it converges to z', i.e., lim^oo z t = z' . We consider 
the iterate of z' . Since F is continuous, we have 

F(z') = F(limzt) = limFOt) = lim z t+1 = lim z t = z'. (4.30) 

Thus, z' is a fixed point. Since z > (0, 0) and z t is increasing, z' cannot be the trivial fixed 
point. Thus z' must be the unique non-trivial fixed point. For Region III, we have a monotonically 
decreasing and bounded sequence z t that must thus converge to a fixed point. By Lemma [4721 this 
fixed point must be in Region III and thus can only be the unique non-trivial fixed point. □ 
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4.3. Proofs. The essential idea is the following. Consider any rectangle in [0, l] 2 whose lower left 
vertex is not (0, 0) (the trivial fixed point introduces some complications, but we can bypass these 
by simply taking larger and larger rectangles). Assume the lower left and upper right vertices are in 
Regions I and III respectively. We show that the image of this rectangle under F is strictly contained 
in the rectangle by showing that the image of the lower left (respectively, upper right) point has both 
coordinates smaller (respectively, larger) than any other iterate. As the lower left and upper right 
vertices iterate to the non-trivial fixed points (since they are in Regions I and III), so too do all the 
other points in the rectangle, as the diameters of the iterations of the rectangle tend to zero. 

We make the above argument precise. Let the rectangle be all points (x, y) E [0, l] 2 with xe < 
x < x u and y e < y < y u . Recall F(x,y) = y), f 2 (x, y)). We choose a point (x,y) 

in our rectangle and let z 0jl (x,y) = x and z 0t2 (x, y) = V- We define the sequence z t (x,y) = 
(z til (x,y), z ti2 (x,y)) (t a positive integer) by z t+ljl (x,y) = fx(z ti i(x,y), z ti2 (x,y)) and z t+li2 (x, y) 
= f2{z t ,i(x,y), z tj2 (x,y)). We show by induction that z t>1 (x e ,yt) < z til (x,y) < z t:1 (x u ,y u ) and 
Zt,2(xe,ye) <z ty2 (x,y) < z ty2 (x U) y u ). In other words, the image of any of our rectangles is contained 
in the rectangle, and the lower left vertex iterates to the lower left vertex of the new region (and 
similarly for the top right vertex). 

The base case is given by our choice of (xe, ye) and (x u , y u ), so we proceed to show the inductive 
step. Suppose that we have z tA (x £ , y e ) < Zt,i(x, y) and z t)2 (x £ , ye) < z t>2 (x, y). Then 

1 - azt A (x e , ye) > 1 - az ti i (x, y) 

l-bz t>2 (x e ,yi) > z tt2 (x,y), (4.31) 

which implies that 

(l-az tA (xe,yi))(l-bzt !2 (x e ,ye)) n > (1 - az tA (x, y))(l - bzt, 2 (x, y)) n (4.32) 
for any n > 1 . Then 

1 - (1 - az t ,i(xt, yt)){\ - bz t , 2 {x e , y e )) n < 1 - (1 - az t>1 (x, y))(l - bz t . 2 {x, y)) n . (4.33) 
That is, z t+lt i(xi, ye) < z t+ i :1 (x, y). Furthermore, we have that 

1 - az ti2 (x e , ye) > 1 - az t , 2 (x, y) 

1 - bz t> i{x£, y t ) > 1 - bz t> i{x, y), (4.34) 

which implies that 

(1- az tj2 (xe,ye)){l-bz t> i{x e ,y e )) > (1 - azt >2 (x, y))(l - bz tjl (x, y)). (4.35) 

Then 

1 - (1 - aztfl(x t ,y e ))(l - bzt^x^ye)) < 1 - (1 - az t>2 (x,y))(l - bz t>1 {x,y)). (4.36) 

That is, z t+lj2 (xt,ye) < Zt +li2 (x,y). 

By a similar argument, we see that Zt tl (x,y) < z t>1 (x u ,y u ) and z t , 2 (x,y) < z tj2 {x w y u ) implies 
that z t+lt i(x,y) < z t+ljl (x u ,y u ) and z t+h2 (x,y) < z t+lt2 (x u ,y u ). 

Thus z tt i(xe,ye) < z t ,i(x,y) < z t ,i(x u ,y u ) and z t>2 (xe,ye) < z t , 2 (x,y) < z t , 2 (x u ,y u ) for all 
t E N. Taking the limit, we have 

lim z t ,i(xg,yi) < lim z t> i(x,y) < lim z t ,i(x u , y u ) (4.37) 

t— >co t— >oo t— >co 



and 



lim z t , 2 (xe,ye) < lim z t , 2 (x,y) < lim z t 2 {x u , y u ) (4.38) 

t— >oo t— >oo t— >oo 

Since (xe, ye) is in Region I and (x u , y u ) is in Region III, the inequalities become 

Xf < lim z tj i(x, y) < Xf (4.39) 
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and 

y f < Jim Zt,2(x,y) < y f . (4.40) 

Thus lim^oo 2^1(2;, y) = x f and lim^^ Zt >2 (x, y) = y f , that is, (x, y) iterates to (x f , yf). 

We can isolate from the proof Theorem 1 1.2[ 11(b) information about the rapidity of convergence. 

Corollary 4.4. Assume b > (1 — a)/y/n. Given a point (x,y) £ (0, l) 2 , consider a rectangle 
with (x, y) on the boundary and vertices (x\, y\) in Region I and (xm, ym) in Region III. Then the 
amount of time it takes for (x, y) to converge to the unique, non-trivial fixed point is the maximum 
of the time it takes (xi, y\) and (im, ym) to converge. 



5. Future Research 

While we are able to determine the limiting behavior of any configuration, a fascinating question 
is to understand the path iterates take when converging to the fixed point. Based on some numerical 
computations and some partial theoretical results, we make the following conjecture. 

Conjecture 5.1. Let b > (1 — a)j ^pri. Points in Regions II and IV exhibit one of two behaviors, 
depending on a,b, n. Either: 

(1) All points in Region II iterate outside Region II and all points in Region IV iterate outside 
Region IV ("flipping behavior"), or 

(2) All points in Region II iterate outside Region IV and all points in Region IV iterate outside 
Region II ("non-flipping behavior"). 

It would be interesting to find simple conditions involving a, b and n for each of the two possi- 
bilities. 

Another topic for future research is to apply the methods of this paper to more general models. 
We present some partial results to a system which quickly follow from our arguments. We may 
consider star graphs with more than two levels, i.e., graphs whose spokes are themselves surrounded 
by additional spokes, which might themselves be surrounded by additional spokes, et cetera. We 
recall that (11.11) and (11.21) give us the following general system: 

Pi,t = (1 - Vi,t-i) Yl i 1 - + S Pi,t II (! - PPj,t-i) 

= 1 - (1 - ap i)t _i) J] (1 - bp jt ^x) . (5.1) 
i~» 

We keep the simplifying assumption that at each level, the number of spokes is the same. In the 
3-level case, this means that we consider a graph with ni spoke nodes around a hub node, and n 2 
spoke nodes around each of the ni spokes. Generalizing our result in the 2-dimensional case that in 
the limit all spokes have the same behavior, we can argue by induction that all nodes on the same 
'level' approach a common, limiting value. Thus, in the f-dimensional case, we are reduced to a 
system in I unknowns. 

We first consider the 3-dimensional case. If we let x t be the probability that the hub is infected 
(the level 1 node), y t be the probability that a spoke of the hub is infected (the level 2 nodes), and 
z t be the probability a spoke of a spoke is infected (the level 3 nodes), (15.11) gives us the following 
system: 

1 - (1 - ax) (1 - by) ni 
1 - (1 - ay) (1 - bx) (1 - bz) n2 I . (5.2) 
1 - (1 - az) (1 - by) 











y 


H 
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We again look for partial fixed points by solving 

x = fi(x,y,z) 



y = h ( x i v, z ) 

z = h(x,y,z), (5.3) 



which gives the following surfaces: 

01 (y, z) = x 

02 0, z) = y 



l-o(l-6y) B1 

1 - {1 - bx) {1 - bz)" 2 
1 - ay (1 - bx) (1 - bzf 2 



<P 3 (x,y) = z = \- -. (5.4) 

1 — a + aby 

If we take the intersection of 0i with the plane defined by 3 and 2 with the plane defined by 
03, we get two curves that look a lot like our curves from the original (2-dimensional) case. We can 
express these curves in terms of x and y. The first curve is already done. For the second, we can 
write 

y-\ 1 
b (1 - ay) (1 - bz) n2 + b' 
Since we know that z = by/ (1 — a + aby) we can write this as 

y- 1 1 
HI ~ ay) (l-j^) b 

We now have two curves, 0i (y) and 02 (y). If we take their derivatives at 0, we obtain 

bri\ 



x = — —tt, rrwj + t- ( 5 - 5 ) 



0'i (0) 



1 — a 



a' (n\ (1 - a) 2 - b 2 n 2 

02 = j-y- r • (5.7) 

o (1 — a) 

Doing some analysis on their second derivatives shows that 0" (y) < and 2 ' (y) > for all 
y G [0, 1]. Thus 0i (y) is convex and 2 (y) is concave. All the pieces are now in place to argue as 
in the proof of Theorem 1 1.21 1(a) and 11(a). We find that there exists a unique nontrivial fixed point 
if and only if 

0; (0) > 0' 2 (0) , (5.8) 

i.e., 

1 — a 

b > (5.9) 

\/ni + n 2 

This leads to the following conjecture (which is known for £ = 2 or 3). 

Conjecture 5.2. Consider a generalized spoke and star graph with t levels. Level one consists of 
one node (the hub), level two consists ofn\ spokes connected to the central hub, and for each node 
of level k there are nk nodes connected to it (and these are the level k + 1 nodes). There is a unique, 
non-trivial fixed point if and only ifb > (1 — a)/ y/n\ + • • • + ngZi. 
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The following appendices highlight some of the approaches we took to 

tackling the problem. The first appendix describes in detail a mostly 

trivial analysis of the n — 1 case, while the second appendix gives 

an eigenvalue approach to the problem, and the final appendix discusses 

some topological approaches to the problem which unfortunately did 

not lead to a complete solution. We include these in the arxiv version 

in case they may be of use to others investigating similar problems. 



Appendix A. Special Case: n = 1 

The dynamical behavior can be directly determined in the special case n — 1. Unfortunately, 
this is a very degenerate case, and many of the ideas and approaches here cannot be generalized 
to higher n, though some can (and in fact the analysis here was helpful in guessing some of the 
general behavior). In this case, it suffices to consider a one-variable problem, namely f(x) = 
1 — (1 — ax)(l — bx). This is because when n = 1 we cannot distinguish a spoke from the central 
node. 

A. 1 . Fixed Points. We know from our main result that there is a unique non-trivial fixed point, but 
we show the proof of that result again here for the special case. 

Lemma A.l. The fixed points of f are and a+ a 6 b ~ 1 . If a + b < 1 there is only one fixed point in 
[0, 1], namely 0. If a + b > 1 then there is a second fixed point in (0, 1). 

Proof. We have 

f(x) — x = 1 — (1 — ax){\ — bx) — x 

= —abx 2 + (a + b)x — x 

= x (abx — (a + b — 1)) 

= abx ( x - a + - — -J. (A.l) 



ab 

As the fixed points are when f(x) — x — 0, the first half of the lemma is clear. 

We must show a+6 ~ 1 e (0, 1). Clearly we need a + b > 1; thus in this case a+6 ~ 1 > 0. To show 
it is at most 1 it suffices to show a + b — 1 < ab or a + b — 1 — ab < 0. As a < 1 we have 

a + b — 1 — ab = a — ab + b — 1 

= a(l-6)-(l-6) 

= (a - 1)(1 - b) < 0. (A.2) 

□ 

A.2. Derivative. Recall f(x) = 1 — (1 — ax)(l — bx). Thus 

Lemma A.2. If a + b < 1 then \f'{x)\ < 1/2 for all x; ifa + b>\ then f'(x) > for all x. 
Proof. We have 

f'(x) = a(l - bx) + 6(1 - ax) 
= (a + b) — 2abx 

= ab[ a ^--2x\. (A.3) 



ab 

Note the first derivative is decreasing with increasing x 



VIRUS DYNAMICS ON STARLIKE GRAPHS 17 

If a + b < 1 then 

= \ a + b-2abx\ < |l/2-(a + 6)| < 1/2 (A.4) 

(note a + b < 1 implies ab < 1/4). 

Assume now a + b > 1. When x = we have /'(0) = a + b > 1. When x = 1 we have 
f (1) = a + b-2ab. Note 

a + 6-2a6 = a-ab + b-ab = a(l - 6) + 6(1 - a) > 0. (A.5) 
Thus the first derivative is always positive. □ 

Remark A.3. A trivial argument could be used to show that ifa + b < 1 then we have a contraction 
map, and everything converges to the trivial fixed point. Thus we shall always assume below that 
a + b > 1, i.e., that we have a non-trivial, valid fixed point. 

Lemma A.4. Ifa + b> 1 then we have f'(l) < I. 

Proof. This follows immediately from 

f'(l) = a(l - 6) + 6(1 - a) < 1-6 + 6 = 1. (A.6) 

□ 

The reason it is important to note that f'(l) < 1 is that we want to show that / is a contraction 
map, at least for a subset of [0,1]. Let x / denote the fixed point a+6 b ~ 1 ■ By the Mean Value Theorem 
we have 

f(x)-f(x f ) = f(t)(x-x f ), £e[x f ,x]; (A.7) 

if x < Xf then we should write [x,x/] for the interval. As /(x/) = Xf, we can easily see what 
happens to a point x under /: 

x -> f(x) = x f + f\Z)(x-Xf). (A.8) 

Thus if x starts above Xf then f(x) is above xj (because the derivative is always positive and 
x > xj); if x starts below Xf then f(x) is below Xf (because the derivative is always positive and 

X < Xf). 

This suggests that we should think of / as a contraction map; the problem is we need to show 
the existence of a 5 E (0, 1) such that < 1 — 5. If this were true, then by the Mean Value 

Theorem we would immediately have / is a contraction. Unfortunately, the derivative can be larger 
than 1; for example, when x = we have /'(0) = a + 6 > 1. Thus for a small interval about x = 
we do not have a contraction. 

We can determine where / is a contraction. We must find x c such that f'(x c ) = 1; as /' is 
decreasing then the interval [x c + e, 1] will work for any e > 0. We have 

1 = f'( Xc ) = a + b-2abx c (A.9) 

implies 

— T = -l 

Lemma A.5. Let a + 6 > 1. The first derivative is decreasing on [0, 1]; thus its maximum is 
/'(0) = a + 6 > 1 and its minimum is f'(l) < 1. Further, f'(x) > lforx G [0,x c ), f'(x c ) = 1 and 
f'(x) < lforx e (x c , 1]. Note f'(x) > 0. 

Proof. That f'(x) is decreasing follows from (IA.3I) : the claims on f'(0) and f'(l) are immediate 
from the other lemmas. The rest follows from our choice of x c . □ 
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A. 3. Dynamical Behavior. Remember we define x c so that f'(x c ) = 1. Further f'(x) is monoton- 
ically decreasing. 

Theorem A.6. Let xq E (0, 1] and assume a + b > 1. Let x m+ i = f(x m ). Then limm-^ x m — Xf, 
where x / is the non-trivial, valid fixed point. 

Proof. If x = then all iterates stay at 0. For any e > 0, if x E [x c + e, 1] then / is a contraction 
map, and the iterates of x converge to Xf, the unique non-zero fixed point. As this holds for all 
e > 0, we see that the iterates of any x E (x c , 1] converge to Xf. 

We are left with x E (0,x c ]. As f'(x) is always greater than 1 on (0, x c ), if x E (0, x c ] then 
f(x) > x. The proof is straightforward. By the Mean Value Theorem we have 

f(x) = f{0) + f(£)x, ee(o,4 (A.ii) 

It is very important that £ E (0, x c ) and not in [0, x c ]. The reason is that f'(x) > 1 in (0, x c ) but 
f'(xc) = 1 (see Lemma IA31) . As /(0) = we have for all x E (0, x c ) that 

fix) = + /'(£)* > x. (A.12) 
If for some x E (0, x c ] an iterate is in (x c , 1] then by earlier arguments the future iterates converge 

to Xf. 

Thus we are reduced to the case of an x E (0, x c ] such that all iterates stay in (0, x c ] . We claim this 
cannot happen. As this is a monotonically increasing, bounded sequence, it must converge. Specif- 
ically, fix an x E (0,x c ). Let x\ = f(x) and in general x m+ \ = f(x m ). Assume all x m E (0, x c ) 
(if ever an x m = x c then x m+ \ = f(x c ) > x c = x m and the claim is clear). Thus {x m } is a 
monotonically increasing bounded sequence, and hence (compactness or the Archimedean prop- 
erty) converges, say to x < x c . By continuity, lim^^ x m+1 = Mm^^ f(x m ) = filim^^ x m ), 
or x = f(x). As x > 0, it must equal the unique, non-trivial fixed point, which cannot happen 
as we are assuming that all iterates are at most x c . Thus some iterate exceeds x c , completing the 
proof. □ 

Remark A.7. Note the above proof required us to be very careful. Specifically, we used the fact 
that f'(x) > lfor x E (0, x c ] to show that such x are repelled from the fixed point 0, and then we 
used the fact that f'(x) < 1 for x E (x c , 1] to show such points are attracted by the non-zero fixed 
point Xf. Arguments of this nature can be generalized. 

Appendix B. Eigenvalue Approach to Fixed Points and Dynamics 

We continue the eigenvalue approach of §[3] to determining the nature of the fixed points. The 
following lemma will be useful. 



Lemma B.l. Let a,b E (0, 1), and set 

* - ( 

b a 



a nb 



Then the eigenvalues of A are a + by/ri, with corresponding eigenvector ^ j, and a — by/n, 



^ n j. We may write any vector ( X 



x \ / y , x \ I Jn \ y x \ l -Wn 



y J \2 2y/nJ V 1 / V 2 2 V™J V 1 
Ifb > (1 — a) l ' y/n then a + by/n > 1. 



(B.2) 
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Proof. The above claims follow by direct computation. It is convenient to write A as 



A = al + by/7i[ 17 V V ) = al + b^iB, (B.3) 





as the eigenvalues and eigenvectors of B are easily seen by inspection. □ 
Remark B.2. The two eigenvectors are linearly independent, and thus a basis. Note that any vector 



X y J with positive coordinates will have a non-zero component in the ( ™ J direction. 

While we were able to explicitly compute the eigenvalues and eigenvectors here, we will not need 
the exact values of the eigenvectors below. From the Perron-Frobenius theorem we know that the 
largest (in absolute value) eigenvalue is positive and the corresponding eigenvector has all positive 
entries (because all entries in our matrix are positive). 

Theorem B.3. Assume n > 2, a, b e (0, 1) and b > (1 — a)/ y/ri. Then there is a p = p(a, b, n) > 
such that ifv=( X )^(^) has \ \v\\ < p then eventually an iterate ofvbyf is more than p 



. y ) V . 

units form the trivial fixed point. In other words, the trivial fixed point is repelling. 

Proof. We must show that if | \v\ | is sufficiently small then there is an m such that | \f m (v ) 1 1 > \ \v\ 
where f 2 (v ) = f(f(v)) and so on. 



We have 

u \ \ ( 1 - (1 -au){\ -bv) 7 

v J ) ~ I 1 - (l-av){l-bu) 



a nb \ ( u \ „ ( ( u 2 + v 2 

°a,b,n „,2 , J • (B.4) 



/ 



b a J V v J a ' b ' n \ \ u 2 + v 

In other words, there is some constant C (depending on n, a and b) such that the error in replacing 

(u \ ( a Tib \ ( u \ ( u 

J by the linear map A = I ^ I acting on I I is at most CI 

Thus if f ™ ^ has small length, the error will be negligible. 

To show that eventually an iterate of v = ( ^ ^ is further from the trivial fixed point than v, we 

argue as follows: we replace / by A, and since one of the eigenvalues is greater than one eventually 
an iterate will be further out. The argument is complicated by the need to do a careful book-keeping, 
as we must ensure that the error terms are negligible. 

Let Ai = a + b\Jn > 1 and A 2 = a — by/n (note | Aa | < Ai as we have assumed a,b > 0). We 
may write A = 1 + 77, with < r\ < y/n. Our goal is to prove an equation of the form 



/(-)(,) =K (V- + *\(^\ + v(l-*\l -V" I + smaU . (B.5) 



2 2y/E) V 1 / V 2 2 VnJ V 1 

We often take m even, so that \™ is non-negative. We may write x = r cos 9 and y = r sin 9, with 
r < p (later we shall determine how large p may be). 

We introduce some notation. By E{z) we mean a vector ( Z ^ ^ such that \z\\, \z%\ < z. Let 

v = v and v k+1 = f(v k ). Thus 

Vl = f(v ) = Av + E(Cr 2 ), (B.6) 

as ||t'o|| 2 = r 2 ; here E(Cr 2 ) denotes our error vector, which has components at most Cr 2 . If 
I \v\ 1 1 > r then we have found an iterate which is further from the trivial fixed point, and we are 
done. If not, | \v\\\ < r. 
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Assume \ \vi\\ <r. Then 

V2 = f( Vl ) = A Vl + E(Cr 2 ). (B.7) 

But Av\ = Av + AE(Cr 2 ), with E(Cr 2 ) denoting a vector with components at most Cr 2 . As the 
largest eigenvalue of A is Ai, we have AE{Cr 2 ) = E(XiCr 2 ). Thus 

v 2 = A 2 v + EiXiCr 2 + Cr 2 ). (B.8) 

If 1 1 V2 1 1 > r we are done, so we assume | \v 2 \ \ < r. Then 

v 3 = f(v 2 ) = Av 2 + E(Cr 2 ). (B.9) 

But Av 2 = A 3 v + AE{X l Cr 2 + Cr 2 ). As 

AE{X x Cr 2 + Cr 2 ) = E(\\Cr 2 + X^r 2 ), (B.10) 

we find 

v 3 = A z v + E(X\Cr 2 + AiC7r 2 + CY 2 ). (B.ll) 
If there is some m such that \ \v m \\ > r then we are done. If not, then for all m we have 

v m = A m v + E (j2 X \ Cr ^j = A ™ v o + E ■ Cr 2 ^j . (B.12) 

Using Lemma IB . 1 1 (writing v = v as a linear combination of the eigenvectors and applying A) 
yields 



+ E (Kjzl. C r 2 ). (B.13) 



Ai 

We shall consider the case x > y; the other case follows similarly. Let m be the smallest even 
integer such that A" 1 > 10;asAi < 1 + yfn < 2y/n we have for such m that A™ < 40n. We 
consider the x-coordinate of v m . As m is even and x > y the contribution from 



is at least AT" ■ > 5a:; the contribution from E ■ Cr 2 ) is at most ^4 ■ Cr 2 < ^ ■ Cr 2 

< 40 ^ rn • r. By assumption, r < p. Let p < 400 Q Cn • Then the x-coordinate of v m is at least Ax 
(since x > y, x > r/y/2). Thus ||v m || 2 > 16x 2 > 8(x 2 + y 2 ) = 8\\v \\ 2 = 8r 2 , which contradicts 
I \v m \ I < r for all m. 

If instead y > x then the same choices work, the only difference being that we now look at the 
y-coordinate. □ 

Numerical exploration suggested the following conjecture (which is Theorem 1 1.21) . 

Conjecture B.4. Let n = 2 and assume a,b e (0, 1) with b > (1 — a)/y/n. The map f is a 
contraction map in a sufficiently small neighborhood of the unique non-trivial valid fixed point 

Vf — I X f J. Thus, ifv—i X ) is sufficiently close to Vf, then the iterates ofv converge to Vf. 



Vf j \y 

While the eigenvalue approach is unable to prove the above, other techniques fared better (and in 
the main body of the paper we proved this by geometric arguments involving partial fixed points). 
Unfortunately the linear approximation of / near the non-trivial valid fixed point Vf is a horrible 
mess, involving numerous complicated expressions of a and b. While we can clean it up a bit, it is 
not enough to get something which is algebraically transparent. 
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When n = 2we have 



bxi 



Using / 



x f 
Vf 



Vf 

x f 
Vf 



yields 



bxi 



abxt 



Vf 



x f 



a )Vf 



b{l-ay f )' 



(B.15) 



ay f 



x f 



axf 



(B.16) 



These relations can help simplify some of the formulas; the problem is the formula for Xf in terms 
of a and b is a nightmare (and remember this is the 'simple' case of n = 2!): 



x f 



2a 3 + b 3 - 2a 2 (2 + b) + a(2 + 2b - 2b 2 ) - by/b A + 4a(l - b)(a - 1 - bf 

2ab(a 2 + b 2 - a(l + 2b)) 



(B.17) 



A, 



(B.18) 



The resulting fixed point matrix is 

a(l-by f ) 2 2b(l-axy f )(l-by f ) 
6(1 — ayj) a(l — bxf) 

We want to show the largest eigenvalue is less than 1 in absolute value when b > (1 — a)/ y/2. 

We know that the critical line is b = (1 — a)/ \[2 = 1/ \[2 — a/ \[2. A good way to numerically 
investigate the eigenvalues of Af is study the eigenvalues along the line b = (m — a )/y/2, with 
1 < m < 1 + \/2. This gives us a family of parallel lines. For a given (valid) choice of m, we have 
max(0, m — y/2) < a < 1. Below (Figures |4] through [8]) is an illustrative set of plots of the largest 
eigenvalue for 5 different choices of m. 



0.2 0.4 0.6 O.S 1.0 



FIGURE 4. Distribution of the largest eigenvalue of Af along the line b 
a) /y/2, withm = 1 + y/2/ 6 w 1.2357. 



m — 



0.4 0.6 



Figure 5. Distribution of the largest eigenvalue of Af along the line b — (m — 
a) /y/2, with m = 1 + 2^/6 w 1.4714. 
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0.4 0.5 0.6 0.7 0.8 0.9 1.0 



FIGURE 6. Distribution of the largest eigenvalue of Af along the line b 
a)/y/2, withm = 1 + 3^/6 « 1.7071. 



im — 



0.7 0.8 0.9 1.0 



Figure 7. Distribution of the largest eigenvalue of Af along the line b 
a) /V2, with m = 1 + 4^2/6 « 1.9428. 



0.85 0.90 0.95 1.00 



FIGURE 8. Distribution of the largest eigenvalue of Af along the line b 
a)/y/2, withm = 1 + 5^/6 « 2.1785. 



m 



It is crucial that m > 1, as m = 1 leads to a coalescing of fixed points (i.e., we have the trivial 
fixed point with multiplicity two, and the third fixed point is not valid). In Figure [9] we plot the 
behavior of 1 — \\{a, 1 — \/2/100), where \\{a, b) is the largest eigenvalue of Af. Note that the 
largest eigenvalue is very close to 1, but always less than 1, for this value of m. 

Note in Figure [9] that Ai is small, especially for large a. This indicates that perhaps when a is 
close to 1 and b = (m — a)/ y/2 that there is a hope of proving the largest eigenvalue is strictly less 
than 1. 

In fact, it is easy to show that if a and b are close to 1, then Xf is close to 1 as well (which 
immediately implies that yf is also close to 1). This implies that the entries of Af are all positive 
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FIGURE 9. Distribution of 1 minus the largest eigenvalue of Af along the line 

b= (m-a)/V2, withm = 1 + v^/lOO « 1.0141. 

numbers close to 0. A simple calculation shows 



Ai(a,6) 



{{l-by } ) 2 + {l-ax f ))a 
2 

v/((l - by f f - (1 - ax/)) a 2 + 86 2 (1 - by f )(l - ax f )(l - ay f ) 



(B.19) 



If a,b,Xf and yj are all close to 1, then \±(a, b) will be small. We have shown 



Lemma B.5. Let n = 2, a, b E (0, 1) and assume b > (1 — a) / \[2. Then if a and b are sufficiently 
large, then f is a contraction map near the non-trivial valid fixed point (i.e., the non-trivial valid 
fixed point is attracting ). 

With some work, using this method we can determine how 'close' a and b need to be to 1. With 
computer assistance, we can partition a and b space, numerically compute the fixed points and 
eigenvalues, and by doing a sensitivity of parameters analysis prove the theorem. 



Appendix C. Injectivity and Topological Arguments 

One approach to this problem is to use topological arguments as a way of showing a contraction 
mapping and thus convergence to a unique non-trivial fixed point. Many of these arguments are 
facilitated by the map being injective; unfortunately, our map is only injective for some values of 
a, b and n. In the injective cases, we can use results from topology to obtain many useful results. 
While these are not used in the proof of our main theorem, we include them as they may assist future 
researchers in studying related questions. As these cannot lead to a complete proof in general, our 
goal is more an exposition of these ideas then including full details. 

Given that we have injectivity in certain special cases, we can analyze the dynamical behavior 
by using results from topology. In the special case with injectivity we can study our map on simple 
closed curves. This gives us the crucial property that our function maps the interior points of a 
simple closed curve to the interior of the image of the curve, and exterior points to the exterior. We 
constantly use the fact that there is a unique, non-trivial fixed point. 

The presence of the trivial fixed point at (0, 0) causes some complications. To simplify the 
analysis, instead of letting Co denote the boundary of the unit square we replace the corner near 
(0, 0) with a semicircular arc from (0, e) to (e, 0). We let C7„ +1 = f(C n ), and note that C„ +1 is 
entirely contained in the interior of C n . For C\, this follows from direct computation; for n > 2 it 
follows from our injectivity assumption. As the fixed point is contained inside C , the fixed point 
is inside C n for all n (it is the only fixed part of the interior and does not move on iteration, and 
thus always remains interior to every curve). This allows us to reducing the proof that all points 
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iterate to the non-trivial fixed point to showing the sequence of boundary curves C n iterate to the 
fixed point. 

We want to prove that the limit of C n is just a point. Unfortunately, the analysis is complicated 
by the fact that it is possible for the boundaries to always contract but not converge to a point. 
We discuss several of the potential obstructions; many of these can be eliminated by using more 
detailed properties of our map. 

If we first assume that the image of every curve is strictly contained in the curve, then standard 
arguments prove that C n converges to the non-trivial fixed point. Consider, instead, the following 
map. It is easier to record what happens to the radius and the angle then the point. For simplicity, 
we assume the non-trivial fixed point is at (0, 0). Given a point (x, y), we write it as (r, 9). Let 



(l + ^,6 + rV2) ifr>l 
(r + r^,9 + rV^) if r < 1. 



(C.l) 



This is an interesting map; the origin is fixed, but all other points eventually iterate to the boundary 
of the unit circle. The origin is the only fixed point (the ta/2 essentially gives us a rotating circle). 
Of course, this map violates many of the properties of our map, in particular it is not a polynomial 
map; however, it does have the property that all boundary curves of the region contract but do not 
converge to the fixed point. 

The example above thus tells us that the analysis of the dynamical behavior must crucially use 
properties of our map, and cannot follow from general topological facts about continuous maps. 
Remember that the functions (j) x and 2 divide the outer boundary of our square into four sub- 
regions, which are our Regions I-IV. We know that Regions I and III (save for the trivial fixed 
point) converge to the fixed point after successive iteration and always remain inside themselves. 
If any part of Region IV iterates into Regions I or III, it will converge to the fixed point, so we are 
not concerned with that aspect of its behavior. The difficulty is when part of Region IV iterates to 
Region II. Since interior and exterior cannot occupy the same space because these are all simple 
closed curves, all of Region I, III, and IV, would flip to outside Region IV (to see this, we separate 
Regions I, III and IV into one closed curve and Region II into another). Unfortunately, this leads 
to a complicated analysis where we start asking how many times we can have iterates of a point 
in IV in IV before entering II; because of these technicalities, we turned to other approaches. The 
interested reader can contact the authors for additional maps and examples. 
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